Subspace Distribution Clustering HMM for Chinese Digit Speech Recognition
نویسندگان
چکیده
As a kind of statistical method, the technique of Hidden Markov Model (HMM) is widely used for speech recognition. In order to train the HMM to be more effective with much less amount of data, the Subspace Distribution Clustering Hidden Markov Model (SDCHMM), derived from the Continuous Density Hidden Markov Model (CDHMM), is introduced. With parameter tying, a new method to train SDCHMMs is described. Compared with the conventional training method, an SDCHMM recognizer trained by means of the new method achieves higher accuracy and speed. Experiment results show that the SDCHMM recognizer outperforms the CDHMM recognizer on speech recognition of Chinese digits.
منابع مشابه
Memory space reduction for hidden Markov models in low-resource speech recognition systems
Low-cost recognition systems based on hidden Markov models (HMM) for mobile speech recognizers (mobile phones, PDAs) have a limited quantity of memory and processing power. Furthermore, the resources have to be shared between several applications. In this paper memory efficient HMMs were investigated for low-cost recognition platforms. The feature parameter tying HMM and subspace distribution c...
متن کاملComparison of low footprint acoustic modeling techniques for embedded ASR systems
In this paper we compare the performance of speech recognition systems based on hidden Markov models (HMM) with quantized parameters (qHMMs) and subspace distribution clustering hidden Markov models (SDCHMMs). Both of these HMM types provide similar performance as continuous density HMMs, but with significantly reduced memory requirements (approximately 90% less memory was needed to store the H...
متن کاملTrajectory Clustering Using Longer Length Units for Automatic Speech Recognition
One of the major deficiencies of conventional hidden Markov modelling (HMM) is known as the trajectory folding phenomenon. Multipath Models can solve the trajectory folding problem by assuming that a large part of the variation in acoustic data can be attributed to different observation classes and which can then be modelled separately. In this paper, we present an approach to automatically clu...
متن کاملHmm Based Speech Recognition of Continuous Thai Digits
Progress on speech recognition of Thai digit strings is presented in this paper. HTK 3.0 was chosen to implement the HMM-based speech recognizer. MFCCs and their delta and delta-delta terms were used as speech features. Several set of HMM parameters were investigated. Two kinds of word searching methods were tried. Recognition accuracy of 98.7% on test data was achieved with a fixed length word...
متن کاملIsolated Malay Digit Recognition Using Pattern Recognition Fusion of Dynamic Time Warping and Hidden Markov Models
This paper is presents a pattern recognition fusion method for isolated Malay digit recognition using Dynamic Time Warping (DTW) and Hidden Markov Model (HMM). The aim of the project is to increase the accuracy percentage of Malay speech recognition. This study proposes an algorithm for pattern recognition fusion of the recognition models. The endpoint detection, framing, normalization, Mel Fre...
متن کامل